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Abstract. We describe a new pipeline used to analyze the data from the fifth science run 
(S5) of the LIGO detectors to search for continuous gravitational waves from isolated spinning 
neutron stars. The method employed is based on the Hough transform, which is a semi-coherent, 
computationally efficient, and robust pattern recognition technique. The Hough transform is 
used to find signals in the time-frequency plane of the data whose frequency evolution fits the 
pattern produced by the Doppler shift imposed on the signal by the Earth's motion and the 
pulsar's spin-down during the observation period. The main differences with respect to previous 
Hough all-sky searches are described. These differences include the use of a two-step hierarchical 
Hough search, analysis of coincidences among the candidates produced in the first and second 
year of S5, and veto strategies based on a ^ test. 



1. Introduction 

Spinning neutron stars are the most promising sources of continuous gravitational wave signals 
for ground-based interferometers such as GEO600 [1], LIGO [2] and VIRGO [3]. Using data 
from the different science runs, there have been two kinds of searches for gravitational waves 
from pulsars: i) targeted searches [H El El El Ej for periodic gravitational radiation from pulsars 
whose parameters (sky-position and frequency evolution) are known through radio observations, 
and ii) searches for pulsars yet unobserved by radio telescopes [9l [lOl [TTl [12l [131 El- For 
this second kind of search the optimal method based on a coherent integration over the full 
observation time is computationally prohibitive for a wide-parameter search. Therefore all-sky 
searches require the use of semi-coherent techniques [151 111 [13 [HI [HI [II] ■, that are less sensitive 
for the same observation time but are computationally inexpensive, or hierarchical approaches 
that combine both methods, optimizing the sensitivity of a search for a given computational 
power [201 EHl IHl 1121 [E]. Some of these semi-coherent methods operate on successive Short 
Fourier Transforms (SFTs) of the measured strain [23] searching for cumulative excess power 
from hypothetical periodic gravitational wave signal taking into account the Doppler modulation 
of the detected frequency due to the Earth's rotational and orbital motion with respect to the 
Solar System Barycenter, and the time derivative of the frequency intrinsic to the source. The 
Hough transform [10^ ITTl [151 U6] is an example of such a method. 

Two flavors of the Hough transform have been developed and employed for different searches. 
The 'standard Hough' [l0l|T5], computes the cumulative excess power as a sum of binary zeroes 
and ones, where a SFT contributes a one if and only if the power exceeds a normalized power 



threshold, and the 'weighted Hough' [111 [T7] in which the contribution of each SFT is weighted 
according to the noise and detector antenna pattern to maximize the signal-to-noise ratio. Using 
the 'weighted Hough' has two notable benefits: i) the gain in sensitivity, and ii) that this method 
can be used to analyze data from multiple detectors, since the different weights automatically 
take into account the different sensitivities. 

This paper describes the pipeline employed to analyze the data from the fifth science run 
of the LIGO detectors to search for periodic gravitational waves using a hierarchical Hough 
transform, emphasizing the changes with respect to previous Hough searches [lUl lllj . The 
paper is organized as follows: In section [2] we briefly summarize the basic principles of the 
Hough transform and its statistics. In section [3] we give an overview of the search pipeline and 
comment on the data employed and the parameter space of the search. In section |3] we describe 
two main features of our two-step hierarchical search, and we conclude in section [SJ 



2. The Hough transform 

The starting point for the Hough transform is a set of N SFTs, short stretches of Fourier 
transformed data which are digitized by setting a threshold pth on the normalized power in 
the different frequency bins: 
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Here Xk is the value of the Fourier transform in the fc*^ frequency bin corresponding to a 
frequency fk, T^oh is the time baseline of the SFT (in our case T^^h = 1800 s), and Sn{fk) is the 
single sided power spectral density of the detector noise at the frequency fk ■ 

For a given template, the Hough number count is a weighted sum of the binary zeroes and 
ones, along the corresponding time-frequency pattern 
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where the weights are defined as 

(i) (i) th 

being ^^[^2 values of the beam pattern functions at the mid point of the i SFT. 

(i) 

Thus, we take a binary count nj, to have greater weight if SFT i has a lower noise floor or 
if, in the time-interval corresponding to this SFT, the beam pattern functions are larger for a 
particular point in the sky. 

The weight normalization is chosen according to 

N-l 

Y^wt^=N. (5) 
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With this normalization the Hough number count n lies within the range [0, A^]. Note that the 
sensitivity of the search is governed by the ratios of the different weights, not by the choice of 



overall scale. The robustness of the Hough transform method in the presence of strong transient 
disturbances is not compromised by using weights because each SFT contributes at most Wi 
(which is of order unity) to the final number count. 

The natural detection statistic is not the Hough number count n, but the significance of a 
number count, defined by: 



where n and a are the expected mean and standard deviation for pure noise. Values of s can be 
compared directly across different templates characterized by different weight distributions and 
a values. Furthermore, in the case of Gaussian noise, the relation between the significance and 
the false alarm probability a is given by: 

a = 0.5erfc(s/\/2) . (7) 

Setting a threshold on the significance would then identify interesting candidates. We refer the 
reader to [11] for further details. 

3. Description of the pipeline 

This paper presents a new pipeline used to analyze the data from the LIGO detectors to search 
for continuous gravitational waves from isolated spinning neutron stars. The LIGO detector 
network consists of two interferometers in Hanford Washington, one 4-km and another 2-km 
(HI and H2) and a 4-km interferometer in Livingston Louisiana (LI). The search described here 
is currently carried out over the entire sky using the data produced during LIGO's fifth science 
run (S5) that started on November 4, 2005 and ended on October 1, 2007, at initial LIGO's 
design sensitivity. Data from each of the three LIGO interferometers is used to perform the 
search. 

The starting point is a collection of SFTs generated directly from the calibrated data stream, 
using 30-minute intervals of data for which the interferometer is operating in what is known as 
science mode. With this requirement, we search 32295 SFTs from the first year of S5 (11402 
from HI, 12195 from H2 and 8698 from LI) and 35401 SFTs from the second year (12590 from 
HI, 12178 from H2 and 10633 from LI). 

The search is performed in the frequency range 50-1000 Hz and with the frequency's 
time derivative in the range —8.9 x 10~^^ Hz to zero, being those values limited by the 
computational cost of the search. We use a uniform grid spacing equal to the size of a SFT 
frequency bin, 6f = = 5.556 x 10~^ Hz. The resolution 6f is given by the smallest value 

of / for which the intrinsic signal frequency does not drift by more than a frequency bin during 
the total observation time T^^^: 6f = Sf /T^^^ ~ 1.8 x 10~^^ Hz s~^. This yields 51 spin-down 
values for each frequency. 6f is fixed to the same value for the search on the first and the second 
year of S5 data, b eing To]^a the value for the first year. The sky resolution, S9, is frequency 
dependent, with the number of templates increasing with frequency, as given by Eq.(4.14) of 
Ref. [T^. This yields a resolution of about 9.3 x 10~^ rad at 300 Hz, which corresponds to 
~ 1.5 X 10^ sky locations for the whole sky at that frequency. 

The key difference from previous searches is that, starting from 30 min SFTs, we perform 
a multi-interferometer search analyzing separately the two years of the S5 run, and we study 
coincidences among the source candidates produced by the first and second years of data. This 
is inspired by a similar coincidence search using VIRGO data [23]. Furthermore, we use a 
test adapted to the Hough transform searches, as described in [25], to veto potential candidates. 

The approach used to analyze each year of data is based on a two-step hierarchical search for 
continuous signals from isolated neutron stars as described below. In both steps, the weighted 
Hough transform is used to find signals whose frequency evolution fits the pattern produced 



by the Doppler shift and the spin-down in the time-frequency plane of the data. The search 
is done by sphtting the frequency range in 0.25 Hz bands and using the SFTs from multiple 
interferometers . 

In the first stage, we break up the sky into smaller patches with frequency dependent size. 
The size of the sky-patches ranges from ~ 0.4 rad x 0.4 rad at 50 Hz to ~ 0.07 rad x 0.07 rad at 
1 kHz. Ideally, to obtain the maximum increase in sensitivity, we should calculate the weights, 
based both on the noise and the antenna pattern, for each sky-location. In practice, we calculate 
the weights just once for the center of each sky-patch. In this first stage, we perform the Hough 
multi-interferometer search using the traditional look up table approach, that enormously reduces 
the computational cost (see \TE] for a detailed description). But limitations on the memory of 
the machines constrain the volume of data (i.e., the number of SFTs) that can be analyzed 
at once and the parameter space (e.g., size and resolution of the sky-patches and number of 
spin-down values) we can search over. For this reason, in this first stage, the Hough transform 
is not applied using all the available SFTs, but selecting the best 15000 SFTs for each frequency 
band and sky-patch. A top-list keeping the best 1000 candidates is produced for each 0.25 Hz 
band. 

In a second stage, we recompute the Hough significance of each candidate in the top-list 
using the complete set of available SFTs from all the interferometers, and at the same time we 
reduce the mismatch of the template, i.e., we calculate the number count and the corresponding 
significance without the roundings introduced by the look up table approach and by recomputing 
the weights for the precise sky location and not for the center of the corresponding patch, as 
previously done. For each candidate, we also compute the value that will help in vetoing the 
resulting candidates. 

4. A tvi^o-step hierarchical Hough search 

In this section we describe in more detail the two main features of the two-step hierarchical 
Hough search, i.e., the selection of the best SFT data and the comparison of the significance 
values produced in both steps. 

4-1- Selection of the best SFT data 

The first of the two main changes to the Hough code consists in selecting the best SFTs based 
on the weights given by Eq. (jj]). In this way, when doing the all-sky search for each 0.25 Hz 
band, for each sky-patch, we will keep the 15000 SFTs that have lower noise and that are more 
sensitive at that sky location. 

Figure [1] shows the percentage of SFTs from each of the three LIGO detectors (HI, H2 and 
LI) that have been selected when doing the Hough search on a 0.25 Hz band at a frequency of 
420 Hz for the S5 first year of data. At this frequency, the number of sky-patches is 426 with a 
size of ~ 0.17 rad x 0.17 rad. The figure shows the percentage of SFTs from each detector on 
each sky-patch. In this figure we can see that, for this particular frequency, the detector that 
contributes the most at almost all the sky locations is HI. The maximum contribution of HI, 
about 64%, is at the poles, LI has it maximum contribution, about 46%, around the equator 
and H2 contributes at most 23% of the SFTs in the region in between. 

The selection of SFTs has been done based on the total weights, that depend both on noise 
floor and antenna pattern. If this selection had been based only upon the weights due to the 
noise floor, the H2 detector would not have contributed at all in this flrst stage. By having the 
weights antenna pattern dependent, H2 has a certain contribution in some sky locations where 
it is more sensitive. 
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Figure 1. Percentage of SFTs that each 
detector has contributed at each particular sky- 
patch when selecting the best 15000 SFTs based 
on the Hough weights. These figures correspond 
to an ah-sky search performed on a 0.25 Hz band 
at 420 Hz for the S5 first year of data. 



4-2. Comparison of the significance values 

In the first stage of the hierarchical search, the Hough significance is computed using the look up 
table approach to enormously reduce the computational cost. The code uses the selected 15000 
best SFTs to compute the Hough significance and make a list of interesting candidates. 

In the second stage of the search, for each candidate stored in the top-list produced for each 
0.25 Hz band, the code recomputes the Hough significance without any rounding, and using all 
the SFTs from the three LIGO detectors. In this stage, the significance is computed directly 
from the number count that is obtained from the digitized time-frequency plane by summing 
weighted binary zeros or ones along the expected path of the frequency of a hypothetical periodic 
gravitational wave signal, taking into account the Doppler shift and the spin-down. 

Figure [2] shows how this recomputation affects the previous value of the significance in the 
case where an injected pulsar was present and in the case of a frequency band free of spectral 
disturbances. To illustrate the effect, we have used a 'quiet band' at 205.25 Hz and a band 
with an injected pulsar at 575 Hz. In both bands we perform an all-sky search and compare 
the significances computed on the first and second stages. The 'old significance' corresponds to 
the one computed in the first stage, using the look up table approach and using only the best 
15000 SFTs. The 'new significance' corresponds to the one computed in the second stage using 
all the SFTs and without any rounding. In the same figure we also provide the histograms of 
the differences between the 'old' and 'new' significances, showing that in the case where we have 
the hardware injected signal, the values of the 'new' significance are, in most cases, higher than 
the 'old' values. While in the case of the 'quiet' band, the 'new significance' becomes smaller 
than the 'old' one in the majority of the cases. 

5. Conclusions and future work 

In this paper, we have presented the improved Hough search pipeline, which is being used 
to analyze data from the fifth science run of the LIGO interferometers, describing in detail 
the two main features of the two-step hierarchical search, i.e., the selection of SFT data and 
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Figure 2. Comparison of tiie distribution of the significance values when calculated in the first 
stage using the look up table approach ('old significance') and after recomputing its value using 
all the SFTs and without any rounding ('new significance'). These results correspond to an 
all-sky search on a 0.25 Hz band at a frequency of 205.25 Hz for the S5 2nd year of data (top 
panels), where no relevant spectral disturbances were present, and at 575 Hz for the 1st year of 
data (bottom panels), containing a hardware injected pulsar. 



recomputation of the significance values. We have shown how these improvements perform on 
the data, showing examples at some particular frequencies, either with interesting artifacts, such 
as hardware injected pulsars, or others where we expect approximately Gaussian noise. Work is 
in progress to analyze the coincidences among the candidates produced in the first and second 
year of S5 for the full parameter space, and to compute astrophysical upper limits using the 
search pipeline presented in this paper. 
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